Using Support Vector Machines for Multicategory Cancer Diagnosis Based on Gene Expression Data
نویسندگان
چکیده
In an effort to contribute to the development of accurate cancer diagnosis based on gene expression data, this study performs a comprehensive evaluation of multicategory Support Vector Machine (MC-SVM) algorithms applied to the majority of cancer-related gene expression microarray datasets currently freely available to the scientific community. Our results show that: (a) MC-SVMs are very effective in performing accurate cancer diagnosis in high-dimensional gene expression data. The MC-SVM techniques by Crammer and Singer, Weston and Watkins, and one-versus-rest are the best methods in this domain. (b) MC-SVMs outperform other popular machine learning algorithms to a remarkable degree. (c) A prototype software tool which develops MC-SVM classifiers in a fully-automated fashion is introduced. Results produced by the tool compare favorably with previously published studies.
منابع مشابه
Classification of Multiple Cancer Types by Multicategory Support Vector Machines Using Gene Expression Data
MOTIVATION High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification method...
متن کاملA comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis
MOTIVATION Cancer diagnosis is one of the most important emerging clinical applications of gene expression microarray technology. We are seeking to develop a computer system for powerful and reliable cancer diagnostic model creation based on microarray data. To keep a realistic perspective on clinical applications we focus on multicategory diagnosis. To equip the system with the optimum combina...
متن کاملFeature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملA Comparison of SVM-based Evolutionary Methods for Multicategory Cancer Diagnosis using Microarray Gene Expression Data
Selection of relevant genes that will give higher accuracy for sample classification (for example, to distinguish cancerous from normal tissues) is a common task in most microarray data studies. An evolutionary method based on generalization error bound theory of support vector machine (SVM) can select a subset of potentially informative genes for SVM classifier very efficiently. The bound theo...
متن کاملGene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method
Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003